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Abstract 



Ch ' We consider the joint source-channel coding problem of sending a Gaussian source over a K-iiser Gaussian 

broadcast channel with bandwidth mismatch. A new outer bound to the achievable distortion region is 
' derived using the technique of introducing more than one additional auxiliary random variable, which was 

previously used to derive sum-rate lower bound for the symmetric Gaussian multiple description problem. 
By combining this outer bound with the achievability result based on source-channel separation, we provide 
approximate characterizations of the achievable distortion region within constant multiplicative factors. 
t/3 , Furthermore, we show that the results can be extended to general broadcast channels, and the performance 

^ ' of the source-channel separation based approach is also within the same constant multiplicative factors of 

the optimum. 

> 

m 

^ 1 Introduction 

\^ Shannon's source-channel separation theorem essentially states that asymptotically there is no loss from op- 
timum by decoupling the source coding component and channel coding component in a one-to-one commu- 
<^ nication system [Ij. This separation result tremendously simplifies the concept and design of communication 
K*" ■ systems, and it is also the main reason for the division between research in source coding and channel cod- 
^ ing. However, it is also well known that in many multi-user settings, such a separation indeed incurs certain 
' performance loss; see, e.g., [2l|3l[lj. For this reason, joint source-channel coding has attracted an increasing 
amount of attention as the communication systems become more and more complex. 

One of the most intriguing problems in this area is the joint source-channel coding of a Gaussian source 
over a Gaussian broadcast channel with K users under the average power constraint. It was observed by 
Goblick [2] that when the source bandwidth and channel bandwith are matched, i.e., one channel use per 
source sample, directly sending the source samples on the channel after a simple scaling is in fact optimal, but 
the separation-based scheme suffers a performance loss [3]. However, when the source bandwidth and channel 
bandwidth are not matched, such a simple scheme is no longer optimal. Many researchers have considered this 
problem, and significant progress has been made toward finding better coding schemes based on hybrid digital 
and analog signaling; see, e.g., O [U [71 [H [9l [TO] and the references therein. 

In spite of the progress on the achievability schemes, our overall understanding on this problem is still 
quite limited. As pointed our by Caire [11], the key difficulty appears to be finding meaningful outer bounds. 
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Such outer bounds not only can provide a concrete basis to evaluate various achievability schemes, but also 
may provide insight into the structure of good or even optimal codes, and may further suggest simplification 
of the possibly quite complex optimal schemes in certain distortion regimes. In this regard, the result by 
Reznic et al. [8] is particularly important, where they derived a non-trivial outer bound for the achievable 
distortion region for the two-user system. This outer bound relies on a technique previously used in the multiple 
description problem by Ozarow [12j , where one additional random variable beyond those in the original problem 
is introduced. The bound given in [8j is however rather complicated, and was only shown to be asymptotically 
tight for certain high signal to noise ratio regime. 

In this work, we derive an outer bound for the ET-user problem using a similar technique as that used 
in [8], however, more than one random variable is introduced. The technique used here also bears a certain 
similarity to that used in [13]. The outer bound has a more concise form than the one given in [H], but for 
the K = 2 case, it can be shown that they are equivalent. In fact the outer bound is a set of outer bounds 
parametrized hy K—\ non-negative variables. Though one can optimize over these variables to find the tightest 
one, this optimization problem appears difficult. Thus we take an approach similar to the one taken in [13j . 
and choose some specific values for the K — 1 variables. Moreover, by combining the outer bound with the 
simple achievability scheme based on source-channel separation, we provide approximate characterization^] 
of the achievable distortion region within some universal constant multiplicative factors, independent of the 
signal to noise ratio and the bandwidth mismatch factoi§|. In one of the approximation, the multiplicative 
factor is roughly of form 2'^ for the distortion at the k-th user, while in the other, the factor is K for all the 
distortions. These results show although Shannon's source-channel separation result does not hold strictly in 
this problem, it indeed holds in an approximate manner. In fact, this set of results is extremely flexible, and 
can be applied in the case with an infinite number of users but the minimum achievable distortion is bounded 
away from zero, for which we can conclude that the source-channel separation approach is also within certain 
finite constant multiplicative factors of the optimum. In this case, these constants can be upper bounded by 
factors relating the disparity between the best and worse distortions, which is not influenced by the number 
of users being infinite. 

Though the outer bound is derived using techniques that have some precedents in the information theory 
literature, the difficult lies in determining which terms to bound. In contrast to pure source coding problems 
or pure channel coding problems, where we can usually meaningfully bound the linear combination of rates, 
in a joint source-channel coding problem the notion of rates does not exist. In [8], the lower bound on one 
distortion is given in terms a function of the other distortion in the two-user problem. It is clear that such 
proof approach becomes unwieldy to generalize to the K-usei case; in fact, the expression given [8] is quite 
involved even for K = 2. In this work, we instead derive bounds for some quantities which at the first sight 
may seem even unrelated to the problem, but in fact serve as an interface between the source and channel 

^We would like to thank David Tse for discussions in ITA 2008 on the formulation of the question, where he was motivated by 
his solution to a deterministic version of this problem. 

■^We would again like to thank David Tse as well as one of the anonymous reviewers for ISIT 2009, for the observation that 
using the point to point distortion bound on each user, one can show that the loss by the source-channel separation approach is 
within certain multiplicative factors exponential in the bandwidth mismatch factor. However, our result here is more flexible, and 
also much stronger for bandwidth mismatch factor larger than one, and particularly so for large bandwidth mismatch factors. 
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coding components, thus replacing the role of "rates" in traditional Shannon theory proofs. 

Inspired by a recent work of Avestimehr, Caire and Tse [14J, where source-channel separation in more 
general networks is considered, we further show that our technique can be conveniently extended to general 
broadcast channels, and source-channel separation based scheme is within the same multiplicative constants 
of the optimum as for the Gaussian channel case. Our technique of introducing additional auxiliary random 
variables is quite flexible and powerful, and as a result it yields sharper bounding constants. 

The rest of the paper is organized as follows. Section [2] gives the necessary notation and reviews an 
important lemma useful in deriving the outer bound. The main result is presented in Sectional and the proofs 
for these results are given in Section [H The approximation result for general broadcast channels is given in 
Section [5l Section [6] concludes the paper. 

2 Problem Definition and Review 

In this section, we give a formal definition of the Gaussian source broadcast problem in the context of Gaussian 
broadcast channels; the notation will be generalized in Section [5] when other broadcast channels are considered. 

Let {S{i)}i=i^2,... be a memoryless and stationary Gaussian source with zero-mean and unit- variance. The 
vector (5(1), 5(2), S{m)) will be denoted as S"™. We use R to denote the domain of reals, and M+ to denote 
non-negative reals. The Gaussian memoryless broadcast channel is given by the model 



where is the channel output observed by the k-th receiver, and is the zero-mean additive Gaussian noise 
added on the channel input X. Thus the channel is memoryless in the sense that {Zi{i), Z2{i), ■ ■ ■ , ZK{i))i=i,2,... 
is a memoryless and stationary process. The variance of Z^ is denoted as N^, and without loss of generality, 
we shall assume 



The mean squared error distortion measure is used, which is given by d{s"^,s"^) = ^^^i{s{i) — s{i))'^. The 
encoder maps a source sample block of length m into a channel input block of length n, and each decoder 
maps the corresponding channel output block of length n into a source reconstruction block of length m. The 
bandwidth mismatch factor is thus defined as 



which is essentially the (possibly fractional) channel uses per source sample; see Fig. [TJ The channel input X 
is subject to an average power constraint. Throughout the paper, we use natural logarithm. 
We can make the codes in consideration more precise by introducing the following definition. 

Definition 1 An {m,n, P,di,d2, ■ ■ ■ ydx) Gaussian source- channel broadcast code is given by an encoding 



Yk = X + Zk, k=l,2,...,K, 




Ni>N2>...> Nk- 
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Figure 1: Joint source-channel coding with bandwidth mismatch. 



function 

f R", (4) 

such that 

1 " 

-^E(X(i))2<P, (5) 
1=1 

and K decoding functions 

3fc:M"^M™, k = l,2,...,K, (6) 

and their induced distortions 

dk = Ed(S-, gkifiS"') + Z]:)), k = l,2,...,K. (7) 

where E(-) is the expectation operation. 

Note that there are two kinds of independent randomness in the system, the first of which is by the source, 
and the second is by the channel, and the expectation operation is taken over both of them. In the definition, 
+ in the expression /(S*™) + Z-j^ is understood as the length-n vector addition. 

From the above definition, it is clear that the performance of any Gaussian joint source-channel code de- 
pends only on the marginal distribution of (S*™, X", 1^^"), but not the joint distribution {S"\ Y{^, ■ ■ ■ , Yj^) 
This implies that physical degradedness does not differ from statistical degradedness in terms of the system 
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performance. Since the Gaussian broadcast channel is always statistically degraded, we shall assume physical 
degradedness from here on without loss of generality. The channel noises can thus be written as 

Zfc = Zfe+i + AZfc, k= 1,2,3,..., K, (8) 

where AZ^ is a zero-mean Gaussian random variable with variance AA'^^ = Nk — N^+i, which is independent 
of everything else; for convenience, we define Zj^+i — 0, and it subsequently follows that ANk = Nk and 
Yk+i = X. 

Definition 2 A distortion vector {Di, D2, ■ ■ ■ , Dk) € , where 1 > Di > D2 > . . . > is achievable under 
power constraint P and bandwidth mismatch factor h, if for any e > 0, there exist integers m and n = hm and 
an (m, n, P, di, d2, ■ ■ ■ , dx) Gaussian source- channel broadcast code such that 

Di + e>di, i = l,2,...,K. (9) 

Note that the constraint 1 > Di > D2 > . . . > is without loss of generality, because otherwise the 
problem can be reduced to an alternative one with few users due to the assumed physical degradedness. The 
collection of all the achievable distortion vectors under power constraint P and bandwidth mismatch factor b 
is denoted by T>[P, b), and this is the region that we are interested in. 

One important result we need in establishing the main theorems of this work is the following lemma, which 
is a slightly different version of the one given in [13] . 

Lemma 1 Let W be a random variable jointly distributed with the Gaussian source vector S"™ in the alphabet 
W, such that there exists a deterministic mapping g -.W ^ M"* satisfying 

m{S'^,g{W)) <D. (10) 

Let U = S + V and U' = S + V + V' , where V and V' are mutually independent Gaussian random variables 
independent of the Gaussian source S and W , with variance cj^ and a'"^ , respectively. Then with a"^ = t and 
cj^ + o"'^ = t' , we have 

1. Mutual information hound 

/(T^;^'-)>_l0g_r_, (11) 

2. Bound on mutual information difference 

I(VF;C/™)-/(Ty;?7"") > -log^i^^4^^^^- (12) 

The proof of this lemma is almost identical to the one given in [13]. The only difference between the two 
versions is that in [13] the random variable W is in fact a deterministic function of S"™, however it is rather 
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straightforward to check that this condition was never used in the proof given in [TB] ; we include the proof of 
this lemma in the Appendix for completeness. 

3 Main Result for Gaussian Broadcast Channels 

Our main results for Gaussian source broadcast on Gaussian broadcast channels are the following theorem and 
its corollaries, the proofs of which are given in the next section; extension of these results to general broadcast 
channels are given in Section [5j 

Define the following region, which is in fact the inner bound via source-channel separation. 

V{P,b)^i^{Di,D2,...,DK):YlANkD;~' <P + N^, 1 > > ^2 > ■ • • > ^i^ > o| . (13) 

Next define the following regions, which in fact are outer bounds to the achievable distortion region. 



V*{P, 6) ^ I (Z?i, L»2, . . . , Dk) : Yl AiVfc(2'=Z?fc)-i < P + A^i, 1 > Di > D2 > . . . > Dk > 0^ 



(14) 



P(P,6) ^ I {DuD2, . . . ,Dk) ■Y.ANkiKDk)--^ < P + N^, I > > D2 > . . . > Dk > 

[ k=l 

(15) 

Theorem 1 

V{P, h) C V{P, h) c V*{P, h) nv{P, b). (16) 

The first outer bound 'D*{P, b) can be understood informally as follows. In the original channel the incre- 
mental channel noise powers are given by ANj., and it is obvious that the separation-based approach provides 
an inner bound to the distortion region. On the other hand, if the channel is enhanced such that the incremen- 
tal channel noise powers are given instead by 2~'^^^ANk, and at the same time the power is increased from P 
to P' = P + Ni — XIa^i then the performance of separation-based approach on this new channel in 

fact provides an outer bound to the distortion region. The meaning of the second outer bound is more obvious 
in conjunction with the inner bound. 

Theorem [T] is stated as inner and outer bounds to the achievable distortion region, however it can be 
observed that the bounds have similar forms, and their difference, in terms of distortions, can be bounded by 
some multiplicative constants. We state these results as corollaries next. 

To present the first corollary, we introduce an additional definition. For any 1 > -Di > D2 > • . . > Dk > 0, 
we associate with it a relaxed distortion vector (Dl,D2, . . . , D'^) and a binary labeling vector {Bi, B2, ■ ■ ■ , Bk) 
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m a recursive manner 



{Dk,Bk) = < , i+y-fc-io.^ ^ , , k = l,2,...,K. 17 

[ (2^+^i=i^^L>fe,l) otherwise 

where we have defined Dq = 1 for convenience. 

With this definition it is easily verified that > D^_^_-^ for k = 1,2, . . . , K — 1, and moreover < 
2 ^j=i ^ Dk- We are now ready to state the theorem. 

Corollary 1 Let {Dl, D^, ■ ■ ■ ,D*^) be the relaxed distortion vector of {Di, D2, Dk). If {Di, D2, . . . , Dk) G 
V{P,b), then {Dl,Dl,...,D*j^) e'D{P,b). 

To understand this theorem in a simpler manner, let us consider the case that > 2Dk-\-i for all k = 
1,2, . . . , K — 1. It is then clear that D'^ = 2^0^, and Corollary [1] essentially states that if {Di,D2, ■ ■ . , Dk) S 
V{P,b), then {2Di,2^ D2, . . . ,2^ Dk) G 'D{P,h). If on the other hand, the condition Dk > 2Dk+i is not 
always satisfied, the relaxed distortion vector is introduced to replace {2Di,2'^D2, ■ ■ ■ ,2^ Dk) which is not 
monotonically non-increasing as required by the definition of T>{P,b); nevertheless, in this case, the difference 
between the two distortion vectors is in fact smaller, being 2 ^ instead of 2 for D^- This result has 

the following intuitive interpretation: if a genie helps the separation-based scheme by giving each individual 
user half a bit information per source sample, and at the same time all the better users also receive this half 
a bit information for free, then the separation-based scheme is as good as the optimal scheme. 

Corollary [1] can in fact be used in the situation where there are an infinite number of users such as in a 
fading channel. Let the set of users indexed by x and their associated distortions be denoted as D^, since 
there may be an uncountable infinite number of them. If we apply the construction given in ()17p . with Bi 
replaced by Bx, supjDa;} taking the role of Di and m.i{Dx} taking the role of Dk, then the following lemma is 
straightforward by observing that s\ri>{D%} < 2 svl^{Dx}, inf{L>*} = 2^^ m.i{Dx}, and sup{D*} > inf{Z)*}. 



Lemma 2 The sequence B^ specified by ( (T7| j satisfies B^ < log2(sup{L'a;}) — log2(inf{D2:}) -|- 1. 

It is clear that the maximum multiplicative constant is less than 2^'^^^^'^ in the statement of Corollary [TJ 
If there exists a lower bound on the achievable distortion for the best user, denoted as djnin' which is strictly 
positive, i.e., \ni{Dx} > dmin > 0' then since suplD^.} < 1, the multiplicative factor is bounded as 

"^min 

Thus even when the number of users is infinite, as long as the lower bound dj^in bounded away from zero, 
the multiplicative factors are in fact finite. More formally, we have the following corollarjf]. 

^Here we directly take the number of users to infinity in Corollary [1] however a more rigorous approach is to derive the outer 
bounds for this case and show the result holds. This can indeed be done either along the line of the proof given in Section |4] with 
careful replacement of summation by integral, or more straightforwardly along the line of proof given in Section [5] 
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Corollary 2 For an infinite number of users indexed by x with inf|£)^}g-p(p.6) inf{-Da;} ^ ^mm' i^x} 
relaxed distortion vector of {Dx}. If {Dx} G 'D{P,b), then {D*} G T){P,b), and furthermore, sup{-^} < 

The next corollary gives another version of the approximation, essentially stating that for any achievable 
distortion vector, its K fold multiple is achievable using the separation approach. In terms of the genie- 
aided interpretation, the genie only needs to provide | log K bits common information to the users in the 
separation-based scheme, then it is as good as the optimal scheme. 

Corollary 3 If {Di, D2, . . . , Dk) G V{P,b), then {KDi, KD2, . . . , KDk)k ^ 'i>{P,b), where {xi,X2, ...,xk)k = 
(min(l, xi), min(l, X2), min(l, xk))- 

Theorem [1] and its corollaries provide approximate characterizations of the achievable distortion region, 
essentially stating that the loss of the source-channel separation approach is bounded by constants. The bound 
on the gap is chosen to be (largely) independent of a specific distortion distortion tuple on the boundary of 
T>{P, b), but it will become clear in the next section that such a choice is not necessary. 

The proofs of Theorem [1] and its corollaries heavily rely on the following outer bound, which is one of the 
main contributions of this work. 

Theorem 2 Let ti > T2 > . . . > tk-i be any K—1 non-negative real values, and tk = 0. If{Di, D2, ■ ■ ■ , Dk) G 
V{P,b), then 



h V n-=i(^.+r,) 



<P + Ni. (18) 



With the above theorem in mind, let us denote the set of distortion vectors satisfying (jl8p for a specific 
choice of ri > r2 > . . . > tk-i as ^(-P, 6, n, . . . , tk-i)-, i-e., 



V{P,b,ru...,TK-i) 

{Di,D2,...,DK):Y,^Nk 



K 



k=l 



(i+Tfc)n-=2(-Pj+^j-i) 



<P + Nu 



1>Di>D2>...>Dk>0 



(19) 



Thus Theorem [2] essentially states that 'D{P,b) C V{P,b,Ti, 
The following corollary is then immediate. 

Corollary 4 



, tr-i) for any valid choice of ti, r2, . . . , t/^-i- 



n >T2 > . . . >TK_ 1 >0 



(20) 
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Figure 2: Illustration of the outer bounds as ri = r varies, for a channel where A'^i = 10, = ^, P = 50, 
and 6 = 2. The solid blue lines are the outer bounds, and the dashed read line is the inner bound based 
on source-channel separation; the black circle gives the trivial outer bound with both users at the respective 
optimum in the one-to-one setting. For comparison, the achievable region using the proposed scheme in [8j is 
also given as the dotted black line. 

To illustrate Corollary HI let us consider the case K = 2 for which the bound involves only one parameter 
Ti. For this case, it can be shown through some algebra that this outer bound is equivalent to the one given 
in [8]. In Fig. [21 we illustrate the outer bounds for several specific choices of ri = r. For comparison, the 
achievable region using the proposed scheme in |8] is also given. Note that although the inner bound given by 
this scheme is extremely close to the outer bound, particularly for the two extreme cases, it appears they do 
not match exactly. 

It is worth emphasizing that we view this outer bound differently from the authors in [8]: for each possible 
value of Ti we view the condition (jlSp as specifying an outer bound for the distortion region {Di,D2); in 
contrast, the authors of [8] viewed the distortion D2 as being lower bounded by a function of Di, and the 
parameter ri was viewed as an additional variable that is subject to optimization, and subsequently only the 
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optimal choice of ti value was of interest. These two views are complementary, however the former view appears 
to be more natural for the -ftT-user problem, which also readily leads to the approximate characterization. In 
certain cases, the second view may be more convenient, such as when we are given a specific achievable 
distortion tuple, and wish to determine how much further improvement is possible or not possible. 

For K = 2, the properties of the outer bound were thoroughly investigated in [8]. In certain regime this 
outer bound in fact degenerates for the case of bandwidth compression and it is looser than the trivial outer 
bound with each user being optimal in the one-to-one setting. Due to its non-linear form, the optimization of 
this bound is rather difficult, and it also appears difficult to determine whether it is always looser than the trivial 
outer bound in all distortion regimes with bandwidth compression. Nevertheless, it is clear that this outer 
bound always holds whether the bandwidth is expanded or compressed, and the approximate characterizations 
are valid in either case. In fact, the outer bound in Theorem[T]can be straightforwardly improved by taking the 
intersection with the trivial point-to-point outer bound, leading to an outer bound which may yield improved 
approximate characterizations for the bandwidth compression case. In the remainder of this paper, we do not 
pursue such possible improvements, but instead only focus on the general settings. 

4 Proof of the Main Result for Gaussian Broadcast Channels 

The proofs of the main result for Gaussian source broadcast on Gaussian broadcast channels are given in 
this section. We start by establishing a simple inner bound for the distortion region 'D{P, b) based on source- 
channel separation, and then focus on deriving an outer bound, or more precisely a set of outer bounds. The 
approximate characterization is then rather straightforward by combining these two bounds. It will become 
clear in Section [5] that the Gaussian broadcast channel proof and results are simply a special but more explicit 
case of those on general broadcast channels. 

4.1 A simple inner bound 

The source-channel separation based coding scheme we consider is extremely simple, which is the combination 
of a Gaussian successive refinement source code and a Gaussian broadcast channel code; this scheme was 
thoroughly investigated in [TS], and a solution for the optimal power allocation was given to minimize the 
expected end-user distortion. Since Gaussian broadcast channel is degraded, the better user can always 
decode completely the messages sent to the worse users, and thus a successive refinement source code is a 
perfect match for this channel. Note that such a source-channel separation approach is not optimal in general 
for this joint source-channel coding problem; see for example [J. 

The Gaussian broadcast channel capacity is well known [T6] , which is usually given in a parametric form in 
terms of the power allocation. In this work, we will use an alternative representation, which first appeared in 
|17j and was instrumental for deriving the optimal power allocation solution in |15j . The Gaussian broadcast 

''We wish to thank Dr. Zvi Reznic for clarifying this point in a private communication. 
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channel capacity (per channel use), can be written in the following form. 



C 



K k 

{Ri,R2,...,RM):Rk>0,k = l,2,...,K, AiVfe exp 2 ^ i?^ ) < P + iV^ }> . (21) 

k=i \ j=i 



The rate R^ is the individual message rate intended only to the k-th user, however due to the degradedness, 
all the better users can also decode this message. Since the Gaussian source is successive refinable [18], by 
combining a Gaussian successive refinement source code with a Gaussian broadcast code achieving (|2ip . we 
have the following theorem. 

Theorem 3 

■D{P,b) OV{P,b). (22) 

Proof 1 We wish to show that any {Di, D2, ■ ■ ■ , Dk) G V{P,h) is indeed achievable. Using the separation 
scheme, we only need to show the rates (i?i, R2, ■ ■ ■ , Rk) specified by 

k 

Dk = exp{-2b^Rj), k = l,2,...,K, (23) 

i=i 

are achievable on this Gaussian broadcast channel. The non-negative vector {Ri, R2, . . . , Rk) is uniquely 
determined by (-Di, D2, . . . , Dx), and it is straightforwardly seen that it indeed satisfies the inequality in 112 
The proof is thus complete. 

4.2 An outer bound 

Next we derive a set of conditions that any achievable distortion vector {Di, D2, . . . , Dk) has to satisfy, i.e.. 
Theorem [2j 

Proof 2 (Proof of Theorem [2]) Let us first introduce a set of auxiliary random variables, defined as 

Uk = S + Vk, k = l,2,...,K-l, (24) 

where 's are zero Gaussian random variables, with variance t^, and furthermore 

Vk = Vk+i + ^Vk, k = l,2,...,K-l, (25) 

where AVk is a zero-mean Gaussian random variable, independent of everything else, with variance Ar^ = 
Tk ~ ''"fc+i- For convenience, we define Uk = S, which implies tk — 0; furthermore, define Uq = 0, i.e., being 
a constant. This technique of introducing auxiliary random variables beyond those in the original problem was 
previously used in U^. [7^ to derive outer bounds, and specifically in fj^ more than one random variable 
was introduced, whereas in only one was introduced. 
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For any encoding and decoding functions, we consider a quantity which hears a certain amount of sim- 
ilarity to the expression for the Gaussian broadcast channel capacity [2l\). and we denote this quantity as 
Ef^g{Ti,T2, . . . , tk-i) due to its sum exponential form 



K 



Ef,g{Ti,T2, TK-l) = ^ ATVfc 



exp 



k=i 



n 



Y,iiur;Yp\ur,uT,...,u. 



m ^ 



(26) 



The subscript (/, g) makes it clear that this quantity depends on the specific encoding and decoding functions. 
Next we shall derive universal upper and lower bounds for this quantity regardless the specific choice of functions 
{f,g), which eventually yield an outer bound for 'D{P,b) . 

Let {f,g) be any encoding and decoding functions that (asymptotically) achieve the distortions (Di, D2, ■ ■ ■ , Dk)- 
We first derive a lower bound for Ef^g{Ti, T2, . . . , tk-i)- Observe that for j = 2,3, . . . , K , 

I{Up; Yj^lUr, U^, ...,Up_,) = I(UJ- Y^) - I{UJI,;YJ^) > | log j j ^ , (27) 



where the equality is due to the Markov string {7™ ^ 

inequality is by LemmaUl Moreover, also by Lemma{l\ we have 



Yp , and the 



/(c/r;yn >y log -i±^, 

I L>i + Tl 



It follows that 



(28) 



^/(C/f;l^"|f/r,C/r,. 



i=2 

k 

log — 1 > lo 

2 Di + ri 2 ^ 

^ ^ i=2 



(l + r,_i)(D,-+r,) 



Summarizing the above bounds, we have 

Ef,g{Ti,T2, TK-l) > ^ AiVfc exp 



K 



k=l 



log ^ , + r 2^ lor 



Di+Ti b 



i=2 



(29) 



(30) 
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Next we turn to upper-hounding Ef^g{Ti,T2, ■ ■ ■ , tk-i), and first write the following. 



n 



i=i 



k 



-Y.[hiY;\U-,)-hiY;\Ur)] 



k 



-E^iYr\ur-i)--EhiYm 



T) 



j=i i=i 
Applying the entropy power inequality 119] for j = 1,2, . . . , K — 1, we have 



exp 



n 



'-h{Yp\Up 



> 



exp 



-KYjliPj 



n 



+ exp [log(27reAA'"j)] = exp 



n 



h{YJl,\U^ 



+ 27reA7V,-. 



For j = K , it is clear that 



exp 



'-h{Y^\u^: 



n 



exp 



n 



2-neNK = 2TTeANK. 



By defining exp [^/i(X"|5™)] = 0, it now follows that 



K 



Ef,giTi,r2, TK^i) = ^ AiVfc exp 



k=l 
K 



m 1 
3-^> 



< 



k=l 



We hound this summation, by considering the summands in the reversed order, i.e., k = K,K — 1, 
Starting with the summands when k = K — 1 and k = K , we have 



AN, 



K-r 



exp 


'^Ef=-,'HY-\ur-i) 


exp 

1 A 7\r_. 


'fEf=iM^"|f/;^i) 






exp(|/i(y/+jC7-)) + 27TeANj 




exp(|/i(y/+i|f/-)) + 2neANj 



exp 


'^Ef=iHYp\up_,) 






exp(fMy/+i|C/™)) + 2^eAiV,- 



ANk-1 + ANk 



exp 

2TreANK 



1 exp 


'_T^EfJiHYr\u^.,) 




2ne rfK-2 


exp(f /i(y/^,|?7f )) + 2TTeANj 
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Continuing this line of reduction, we finally arrive at the step when k = 1 as 



Ef,g{Ti,T2,...,TK-l) <ANr 



exp [^hiY{^)] 



exp [^h(Y{'\U^))] + 27reAiVi ^ 27re [expi^hiY^^'lU'^)) + 2T:e/^Ni 



1 exp 



exp [lh{Y^)] 
exp [f /^(ITl^r))] + 27reAiVi 
exp [lh{Y{^)\ 



exp [|/i(y2"l^r 

27re 



27re 



<P + iVi, 



where the last inequality is by the concavity of the log(-) function and the power constraint. 
Combining 130\) and 136\). it is clear that for any encoding and decoding functions (/, g) 



(36) 



P + Ni>Ef^g{n,T2,. 



K 



■ ,TK-i) > ^ AiVfcexp 
fc=i 



Di +Ti b ^-^ 



which completes the proof. 



(37) 



The meaning of the newly introduced random variable Uk can be roughly understood as the message 
meant for the k-th user. Under this interpretation, the term ^liUJT", Y^\UY^, C/™, . . . , UJP'_^) in the quantity 
Ef^g(Ti,T2, . . . ,Tx-i) essentially represents the individual rate intended for the k-th user in the Gaussian 
broadcast channel; this informal understanding provides the rationale for bounding Ef g{Ti,T2, . . . ,tk-i). 
This interpretation is nevertheless not completely accurate, and thus the outer bound is likely to be not tight 
in general, but suffices to provide an approximate characterization. 



4.3 The approximate characterization 

Now we are ready to prove Theorem [1] and its corollaries. 

Proof 3 (Proof of Theorem [1] and its coroUaries) The first inclusion 'D{P,b) C T){P,b) in Theorem\^is 
simply Theorem\^ and thus we focus on the other inclusion T>[P,b) C 'D*{P,b) OV^Pjb), for which we prove 
V{P, b) C P*(P, b) and V{P, b) C V{P, b) separately. From Theorem [3 it is clear that if {Di,D2, Dk) £ 
T>{P,b), then M^) holds with any Ti > T2 > ■ . . > tk^i > 0, and thus holds when we choose Tt = 
for k = 1,2, . . . , K — 1. It follows that the following condition has to be satisfied by any achievable distortion 
vector 



K 



k=l 



<P + Ni. 



(38) 



However, notice that 



K 



k=l 



n-=i(^,+^,) 



K 

>YANk 

k=l 



-.2Dj- 



n 



K 
k=l 



(39) 
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It now follows straightforwardly that any achievable distortion vector has to satisfy 

K 



Y.^Nki^'^Dk)--^ <P + Ni, (40) 



k=l 



and V{P, h) C P*(P, h) is proved. 

To prove 'D{P,b) C V{P,b), note again that any achievable distortion vector has to satisfy Theorem\^ thus 
we only need to prove 



1 ^ (i+Tfc)n'=2(-p.-+ ^.-1 



< ' k = l,2,...,K, (41) 



for some specific choice of ti > T2 > ... > t/^-i > 0. We first consider the case that Di < 1/K; the case 
that Di > 1/K will be discussed shortly. We take an induction approach, for which the following auxiliary 
quantities are needed 

a. = ^^^, fc=l,2,...,i^-l. (42) 

We claim that there exist ti > T2 > . . . > tk-i > such that |^ j[ ) holds with equality for k = 1,2, . . . , K — 1, 
and holds for k = K with equality or inequality; moreover with these Tk's we have < {K — k)~^ . First 
consider the case k = 1, since the function 

^k{T) = (43) 

P>k+T 

is monotonically decreasing and continuous in the range [0, oo], as long as 

$i(0) = — > > 1 = <l>i(oo), (44) 

^ ' Di - KDi - 1^ ^' ^ ' 

there exists a unique solution of ti such that (^Tp holds with equality. This is indeed true, which gives 



{K - 1)D 

Tl = 

It follows that 



^ 1^1(1 + ri) ^ Dijl-Di) ^ Di ^ _J_ 

D2 + T1 D2{l-KDi) + {K-l)Di- {K-l)Di K-1 ^ ' 

and thus our claim holds for k = 1. Next suppose the claim is true for k = k* and we prove it is also true 
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k = k* + 1, for which we wish to find r^.+i such that 

\k*+l/ 



1 _ {^ + Tk*+i)UU {^j+r,-i 



ik* 



(1 + Tfc,) Uj=2iDj + Tj-l) (1 + Tfc, + i)(Z?fc. + i + Tk*)_ 

) 

(47) 



KDk* (1 + Tk*){Dk*+i + Tfc.+i) ' 

where the last equality is by the supposition that the claim holds true for k* . Again by the monotonicity and 
continuity of <l>fe(r), as long as the choice of {ti,T2, . . . , r^*) satisfies 

there exists a valid solution Tk*+i in [0, r^*] for ( [^7[ j to hold. The second inequality in clearly true, and 

thus we only need to consider the first inequality. However, notice that 

Dk*{'i + Tut) ah* 11 . . 

< 7 ^ < , (49) 



L>fc.+i(Z)fc.+i + Tfc.) Dk*+i {K-k*)Dk*+i Dk-'+i 
and we thus conclude that there indeed exists a valid solution of Tk*+i for |77| ), or more precisely 

Tk*+i = 7^ (50) 

Ok* - T>k* + l 

To bound ak*+i, we write 

Dfc. + l(l + Tfc. + l) Dk* + l{l + Tk* + l) 
Dk*+2+Tk*+l Tk^+l 

^ ak*{l-Dk*+i) ^ ak' ^ {K-k*)~' ^ 1 .... 
1-ak* - 1-ak* ~ 1- {K -k*)-^ K - k* - 1' ^ ' 

where the last inequality is by the monotonicity of x/{l — x) in [0,1), and the fact a^* < {K — k*)^^ . The 
induction is thus complete. It only remains to check that when k = K the inequality I^I] ) still holds, for which 
we have 

WU^D, + r,_i) _ (1 + TK-i) + r,-i) Dk + tk-i 



Dk Uf=-i{D, + r,) nf=l'P. + r,) (1 + rK-i)DK 



""^^^^-^ ' >-i-, (52) 



KDk-i (1 + tk-i)Dk aK-iKDK " KDk ' 

where the last inequality is by the fact ax-i < 1- We have thus proved that T>{P,b) C 'D(P,b) for the case 
Di < 1/K. 
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Next we briefly discuss the case Di > D2 > • • • > > 1/K > -D^+i > • • • ^ D^, and we shall prove that 
(41^ holds for some ti > T2 > ■ ■ ■ > tk-i > 0. Notice that by choosing ti,T2, ■ ■ ■ ,Tr sufficiently large, we can 
make 



- > 



1,2, 



(53) 



because of the strict inequality given in Di > D2 > • • • > Dr > 1/K. We have also 
Dk, and thus there exist r^+i > • • • > tk-i such that 



K-r 



>^> Dr+1 >...> 



< 



(54) 



holds with equality for k = r + l,r + 2, — 1 and with either equality or inequality for k = K. Since 

we can choose ti, T2, . . . , sufficiently large, it is clear that indeed this set of tj. 's makes |^ j[ j hold for k = 
r + l,r + 2, . . . , K . This completes the proof for 'D{P, b) C V^P, b). 

To prove Corollary \^ we need to choose different values for (ri, r2, . . . , rx-i). Essentially, when the 
condition 2Dk < D^^i is not satisfied for certain k, we will choose to ignore the contribution of D^^ in the 
outer bound by taking an appropriate value ofr^. For convenience, define the set B = {k : Bj. = 1}, where is 
the labeling function given before Theorem\^ denote the member of B in an increasing order as ki,k2, ■ ■ ■ , k^Q^, 
where \B\ is the cardinality of the set B. The value of Tk 's are set by the following recursive formula 



Tk 



Dk ifBk = l 
Tk-i ifBk = 



k = l,2,...,K -I, 



(55) 



where we have defined tq = 1 for convenience. 

Note that Tk+i < Tk for k = 1,2, . . . , K — 2. It follows from Theorem\^ that this achievable distortion vector 
has to satisfy 



K 



k=l 



i^+rk)UU^D,+T,^i) 



n-=iP.+T,) 

\B\ 



N,-Nk, + J2iNk,-Nk,^,] 



i=l 



(i+Ajn;=2(A,+A,_j 



IY,=i{Dk,+D, 



<P + Ni 



(56) 



where for convenience we define k^i3\+i = K + 1; to see the equality holds, note that the terms for k ^ B are 
combined with the terms of k (z B, because this choice of {ti,T2, ■ ■ ■ ,tk-i) cancels out some of terms in the 
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numerator and the denominator. This implies that the distortion vector has to satisfy 



\B\ 



i=l 



\B\ 



(57) 



i=l 



k=l 



where the last two equalities are by the definition of . The proof can now he completed by applying Theorem 



Corollary directly follows from Theorem [Jl and no additional proof is necessary. 

5 Gaussian Source on General Broadcast Channels 

In this section, we show that the results for sending Gaussian source on Gaussian broadcast channels can be 
conveniently extended to general broadcast channels, which was inspired by a recent work by Avestimehr, 
Caire and Tse [H]. 

The broadcast channel is now given by an arbitrary conditional distribution P{Yi,Y2, ...,Yk\X), in the 
alphabets {yi,y2, ■ ■ ■ ,yK, X). For brevity, we omit repeating the definition of the codes here. To distin- 
guish from the Gaussian channel case, we denote the achievable distortion region as T>g{b), with a band- 
width mismatch factor b. For the separation-based scheme, we shall consider combining successive refinement 
source codes with broadcast codes with degraded message set [20] . Particularly, for an arbitrary permutation 
vr : {1,2,..., K} —i- {1,2,..., K}, the degraded message set requirement implies that there are a total of K 
independent messages {Wi, W2, . . . , Wk), such that the user 7r{k) should decode the messages Wi, W2, . . . , W^. 
For an arbitrary permutation vr, let us denote the achievable distortion by the separation approach of combin- 
ing successive refinement source code with the broadcast code with degraded message set as 'Dg{b), and the 
overall achievable region by this separation approach is given by 



Clearly the convex closure of the above region is also achievable, however such generality is not required. 

It is worth noting that since the characterization of broadcast capacity with degraded message set is still 
an open problem for K > 2, we do not have a characterization for the region T>g(b). However, if the broadcast 
channel is degraded, then only one permutation needs to be considered; moreover, if the capacity region of 
broadcast channel (with degraded message set or it is a degraded broadcast channel) is known, such as for the 
Gaussian broadcast channel case, the region "Dgib) can then be completely characterized. 

Now we present the counterpart of Corollary [H Corollary [2] and Corollary [3] for general broadcast channels. 

Corollary 5 Let (DJ", . . . , D^) be the relaxed distortion vector of {Di, D2, ■ ■ ■ , Dk). If {Di,D2, . . . , Dk) G 








(58) 



TT 



Vg{b), then {Dl,D*2,...,D*j,) eVg{b). 
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Corollary 6 For an infinite number of users indexed by x with inf{£)__.|g-pg(f)) iiif{Da;} ^ '^min > 0' i^x} 
the relaxed distortion vector of {Dx}. If {Dx} G Vg{h), then {-D^} € Vg{h), and furthermore, sup{-^} < 

Corollary 7 If {Du D2, . . . , Dk) e Vg{P,b), then {KDi, KD2, . . . , KDk)^ e Vg{P,b). 

We only prove Corollary [7] here, since the proof of Corollary [5] and Corollary [2] are similar. 

Proof 4 Assume a distortion vector (Di, D2, ■ ■ ■ , Dk) is indeed achievable with certain coding scheme. Let 
us view the induced random mapping from 5™ to 3^f, i = 1,2,..., K by this joint source channel code as a 
super-broadcast-channel; without loss of generality, let us assume Di > D2 > • • • > Dk > 0. 

We pick up the story from {21) and h2^} . and claim that there exists a degraded message set broadcast code 



on the super-broadcast- channel with asymptotic rate per each length-m block 



2 " Di+Ti 
^ 2 S(l + r,_i)P,+r,, 



^r = ?log^^^^^%^, J = 2,3,...,K. (59) 



By Ili27\ ) and i28\). we claim that the distribution Ui^,U^, . . . ,U^,X"^ can be used to construct a well- 
known super-position code with the above rates 119]. This code needs to span over n' blocks, and note that the 
induced super-broadcast- channel is block-wise memoryless. We only need to confirm that receiver k can decode 
all the messages up to the k-th layer by successive decoding. Observe that since 

indeed receiver k can decoder the first layer code, and thus recover the codewords based on Ui. The above 
inequality essentially shows that the channel to is more powerful than that to Yi , with the given channel 
input distribution. For the j-th layer where j < k, we have 



m . (1 + Tj){Dk + r,_i) ^ m ^ (1 + Tj){Dj + t,_i 
(14 

because of the monotonicity of the function 



f{D) = (62) 



D + T, 



when Tj_i > Tj, and the fact < Dj. Thus our claim is indeed true. 

Using this set of degraded message set broadcast channel codes, we can achieve ( or more precisely, approach 
arbitrarily close to) the following distortion for any ti > T2 > . . . > tk-i > 0, and tk — 0, 



k 



m (1 + Tfc+i) 11^=2 {Dj + Tj-l) 



=1 
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However, in the proof of TheoremUi we have already showed that there exist such t's that ( f^ip holds, i.e., 

Dl<KDk, k = l,2,...,K, (64) 
and the proof is complete by the fact that the achievable distortion region is a closed set. 

6 Concluding Remarks 

We derived a new outer bound to the achievable distortion region for the joint source-channel coding problem 
of sending a Gaussian source over a Gaussian broadcast channel with bandwidth mismatch. When combined 
with a simple separation-based achievability result, this new bound leads to approximate characterizations of 
the distortion region within some universal constant multiplicative factors. These results are further extended 
to the case of Gaussian source broadcasting on general broadcast channels. 

The outer bound was not fully optimized, which seems to be a difficult problem by itself. Depending on 
the specific achievability scheme, it may be beneficial to investigate the outer bound more thoroughly. In the 
current work, we only considered the separation-based scheme that yields approximate characterizations. 

The technique used in this work can be applied to the problem of multi-source broadcasting on more 
complex communication networks under certain conditions. We believe that similar results hold for many 
classes of suitably well-behaved networks, and this is part of our on-going work. 

The outer bounding technique of introducing auxiliary random variables used in this work is inspired by 
those used in [12^181113]. We believe this technique is also promising for other multi-user information theoretic 
problems. The role of the auxiliary random variable introduced in [12\ [8] was not quite well understood or 
interpreted previously, and may even appear mysterious to many researchers less familiar with the specific 
problems being treated. The current work (see also [13]) has made the meaning of the introduced random 
variables explicit. More specifically, they are introduced to substitute the messages in the pure channel coding 
problem. By this substitution, the quantities representing rates are replaced by the corresponding information 
quantities. With the interpretation made clear in a general manner, it is our hope that this technique can 
inspire other meaningful results in the future. 



20 



A Proof of Lemma [T] 



Proof 5 Define Z' = V + V' and Z = V, and thus U' = S + Z' and U = S + Z . To prove the first statement, 
we consider the following chain of inequalities 



I{W; U'"") = mh{U') - h{U"^\W) 

= mh{U')-h{S'^ + Z"^\W) 
= mh{U') - hiS"" + Z'™ - 

> mh{U') - hiS"" + ZX" - S"") 
(b) ™ 

> mh{U') - + Z'{i) - S{i)] 



i=l 



> mh{U') 



rn ^ 

- 5^ - log {(27re)E[(5(i) + - S{t))']] 

i=l 

m ^ 

h{U') - E 9 [(27re)(Ed(5(i), S{i)) + r') 



i=l 



where S"^ is the reconstruction with W , and its i-th position is denoted as S{i). The inequality (a) is because 
conditioning reduces entropy, (b) is because of the chain rule for differential entropy and the fact that condi- 
tioning reduces entropy, and (c) is because Gaussian distribution maximizes the differential entropy for a given 
second moment. Since log(-) is a concave function, we have 



log (27re)(Ed(5(i),5(i))+r') 



i=l 



m , 



27re (Ed(5",5") + r' 



It follows 



I(W; U'"") > mh{U') - y log 2^e (Ed(5'^, S~^) + r' 



777 

> mh{U') - — log [2ne{D + r') 
m 1 + r' 



which is the first claim in the lemma. 

To prove the second claim, we write the following 



liW; U"") - I{W; U'"") = mh{U) - mh{U') + h{U"^\W) - /i([/'^Iiy). 
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For the latter two terms, we have 



h{U"^\W) - h{U'^\W) h{U"^\W) - h{U'^\W) 

h{U"^\W) - h{U'"'\V"^ ,W) 
= I{U'"'-V''"'\W), 



> 



where (a) is because is independent ofW^ and W; (b) is by the definition ofU'. Continuing the chain of 
inequalities, we have 

(b) 

> h{V"") - h{V''^\S'^ - 5"" + + V"^) 

m 

^ [hiv'ii)) - h[r{i)\s{i) - s{i) + v{i) + v'ii))' 

i=l 
m 

= ^i{v'{iy,sii)-sii) + vii) + v'{i)) 

i=l 

Wy^lj^ Ed{Sii),S{i))+T' 

(e) m , D + t' 

^ 2-^"^^T7' 

where (a) is because V' is independent ofW; (h) is because conditioning reduces entropy; (c) is by applying the 
chain rule, and the facts that V'"^ is an i.i.d. sequence and conditioning reduces entropy; (d) is by applying the 
mutual information game result that Gaussian noise is the worst additive noise under a variance constraint, 
and taking V'{i) as channel input; finally (e) is due to the convexity and monotonicity of log inx E (0, oo) 
when r' > T > 0. This completes the proof for the second claim. 
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